Rule?based preprocessing for data stream mining using complex event processing

نویسندگان

چکیده

Data preprocessing is known to be essential produce accurate data from which mining methods are able extract valuable knowledge. When constantly arrives one or more sources, techniques need adapted efficiently handle these streams. To help domain experts define and execute tasks for streams, this paper proposes the use of active rule-based systems and, specifically, complex event processing (CEP) languages engines. The main contribution our approach formulation procedures as detection rules, expressed in an SQL-like language, that provide a simple way manipulate temporal data. This idea materialized into publicly available solution integrates CEP engine with library online mining. evaluate approach, we present three practical scenarios rules preprocess streams aim adding information, transforming features handling missing values. Experiments show how effective language express modular high-level manner, without significant time memory overheads. resulting do not only improving predictive accuracy classification algorithms, but also allow reducing complexity decision models needed learning some cases.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Complex Event Processing and Data Mining for Smart Cities

Complex Event Processing (CEP) is emerging as a new paradigm for continuous processing of streaming data in order to detect relevant information and provide support for timely reactions. The main role of a CEP engine is to detect the occurrence of event patterns on the incoming streaming data. However, the problem of discovering the event patterns, although strongly related to the data mining f...

متن کامل

Mining Co-location Patterns from Spatial Data Using Rulebased Approach

Co-location pattern is a group of spatial features/events that are frequently co-located in the same region. The co-location pattern discovery process finds the subsets of features frequently located together. Co-location rules are identified by spatial statistics or data mining techniques. A co-location algorithm has been used to discover the co-location patterns which possess an ant monotone ...

متن کامل

Retractable Complex Event Processing and Stream Reasoning

Complex Event Processing (CEP) deals with processing of continuously arriving events with the goal of identifying meaningful patterns (complex events). In existing stream database approaches, CEP is manly concerned by temporal relations between events. This paper advocates for a knowledge-rich CEP with Stream Reasoning capabilities. Secondly, we address the problem of revision in event processi...

متن کامل

Using data-stream and complex-event processing to identify activities of bats

Ground nodes receive the signals transmitted by the mobile nodes. They are also sensor nodes that run on batteries. Currently, all their detections are forwarded to a central base station, where a localization method integrates them into a position estimation for each bat [NKD15]. The base station is a standard computer with sufficient power and energy. In the future, some parts of the localiza...

متن کامل

Stream Data Processing: A Quality of Service Perspective - Modeling, Scheduling, Load Shedding, and Complex Event Processing

Imagine that you get such certain awesome experience and knowledge by only reading a book. How can? It seems to be greater when a book can be the best thing to discover. Books now will appear in printed and soft file collection. One of them is this book stream data processing a quality of service perspective modeling scheduling load shedding and c. It is so usual with the printed books. However...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Expert Systems

سال: 2021

ISSN: ['0266-4720', '1468-0394']

DOI: https://doi.org/10.1111/exsy.12762